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Outdoor  Recreation  Surveys:   Length-of-Stay  Bias 
and  !ts  Correction  by  Computer  ^ 


Simple  random  samples  of  recreationists 
taken  at  recreation  sites  may  not  provide 
representative,  unbiased  estimates  of  visitor 
characteristics.  This  problem  has  been  point- 
ed out  before,  and  a  computational  procedure 
to  remove  the  effect  of  the  bias  has  been  sug- 
gested.1 A  computer  program  to  make  the  ad- 
justment is  reviewed  in  this  Note. 

The  bias  arises  in  on-site  surveys2  because 
the  probability  that  a  camper,  swimmer,  or 
other  recreationist  will  fall  in  the  sample  and 
be  interviewed  is  a  function  of  how  long  he 
is  in  the  campground,  near  the  beach,  and  so 
on.  Campers  who  stay  only  2  days,  for  ex- 
ample, are  only  one-tenth  as  likely  to  be 
sampled  as  campers  who  stay  20  days.  Where 
activities  or  attitudes  are  the  main  interest, 
this  sampling  of  visitor-days  may  be  appropri- 
ate. If  the  survey  is  made  to  estimate  charac- 
teristics of  visitors,  however,  the  summar- 
ized responses  must  be  corrected  for  sampling 
bias. 


Lucas,  Robert  C.  Bias  in  estimating  recreation- 
ists' length  of  stay  from  sample  interviews.  Jour. 
Forestry  61:  912-914.  1963.  The  problem  was 
also  recognized  in  Standards  for  Traveler 
Studies,  prepared  by  the  Western  Council  for 
Travel  Research,  University  of  Utah,  Salt  Lake 
City,  1963,  p.  23;  but  it  was  assumed  that  bias 
could  only  be  corrected  by  arbitrary  adjust- 
ments. 

Redefining  the  population  of  interest  by  taking 
a  sample  from  a  registration  list  or  at  an  area 
entrance  or  exit  point  avoids  this  bias  since 
each  recreationist  can  then  enter  the  sample 
only  once  on  each  visit.  If  the  target  population 
is  defined  as  visitors,  other  procedures  are 
necessary,  such  as  removing  registrations  on 
second  and  subsequent  visits. 


Past  recreation  surveys  carried  out  on- 
site  have  not  been  corrected  for  length-of- 
stay  bias.  This  is  unfortunate  because  uncor- 
rected surveys  overestimate  length  of  stay 
and  all  factors  associated  with  it.  For  exam- 
ple, we  know  that  people  who  live  farther 
from  a  site  generally  visit  it  less  often  but  for 
longer  periods  than  do  local  people.  Thus  an 
on-site  sample  of  visitors  overrepresents  these 
more  distant  people,  who  may  also  be  better 
paid,  more  educated,  have  different  attitudes, 
spend  more,  and  so  on. 

No  completely  satisfactory  way  of  avoid- 
ing this  bias  in  the  original  data  collection  is 
available.3  Because  the  bias  is  a  simple  func- 
tion of  length-of-stay,  however,  correction  for 
it  is  quite  straightforward.  If  each  interview 
is  weighted  by  the  inverse  of  the  length  of 
stay,    the    bias    disappears.4   This    weighting 


One  possible  way  would  involve  first  asking  a 
respondent  his  intended  length  of  stay,  and  then 
accepting  or  rejecting  him  for  interviewing  by 
some  randomizing  process  proportionate  to  the 
inverse  of  his  stay.  Every  tenth  10-day  visitor 
could  be  interviewed,  every  fifth  5-day  visitor, 
and  so  on.  This  toould  appear  to  work  but  it 
seems  clumsy.  Another  would  require  taking  a 
100-percent  sample  (a  census)  of  a  part  of  the 
study  area  for  a  portion  of  the  study  period: 
This  cluster  sampling  would  be  less  sensitive  to 
possible  variation  over  time  and  area,  however. 
To  be  precise,  the  amount  of  time  the  recre- 
ationist is  present  during  the  period  of  the  day 
on  which  sampliyig  occurs  should  be  used.  A 
camper  who  arrives  after  dark  and  breaks  camp 
early  the  next  morning,  for  an  extreme  example, 
has  zero  probability  of  being  interviewed  in 
most  surveys.  However,  this  refinement  is  dif- 
ficult to  make  in  most  situations,  and  appears 
to  add  little  accuracy. 
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scales  down  the  overrepresented  longer  stays 
relative  to  the  underrepresented  shorter 
stays.  Sampling  with  replacement  is  neces- 
sary; if  a  party  falls  in  the  sample  a  second 
time,  it  should  be  included  —  either  by  dupli- 
cating" the  previous  interview  or  by  reinter- 
viewing.  If  it  is  desired  to  have  the  total  num- 
ber of  observations  after  weighting  be  the 
same  as  the  number  in  the  original,  unweight- 
ed data,  a  constant  multiplier  can  be  incor- 
porated into  the  weighting  factor. 

While  computing  an  average  length-of- 
stay  requires  only  a  simple  count  of  the  ob- 
servations for  each  possible  stay,  computing 
other  sample  characteristics  that  must  be  cor- 
rected for  a  length-of-stay  bias  involves  tabu- 
lating the  frequency  of  each  response  within 
each  length-of-stay.  As  the  number  of  pos- 
sible responses  to  a  question,  the  number  of 
questions,  and  the  number  of  interviews  all 
increase,  the  mechanical  procedures  of  list- 
ing and  calculating  by  hand  become  over- 
whelming. 

A  computer  program  prepared  at  the 
Lake  States  Forest  Experiment  Station,  de- 
signated LSWTFREQ.  can  more  efficiently 
provide  the  type  of  information  suggested 
above :  weighted  frequency  distributions  with 
weights  determined  by  some  critical  variable, 
here  the  length-of-stay.  In  addition  to  speci- 
fying these  weights,  the  program  user  can 
select  another  variable  so  that  separate  fre- 
quency distributions  will  be  printed  within 
each  value  of  this  "classification"  variable. 
For  example,  the  user  might  obtain  a  fre- 
quency distribution,  corrected  for  a  length- 
of-stay  bias,  of  distances  traveled  for  each  of 
several  recreationist  income  classes. 

The  program  is  general  in  that  it  can  be 
applied  to  other  sorts  of  weightings  and  other 


types  of  studies,  provided  the  bias  or  required 
adjustment  can  be  keyed  to  the  value  of  some 
recorded  characteristic  of  the  sampled  popu- 
lation. For  example,  this  characteristic  might 
be  size  of  firm,  acres  of  land  owned,  or  dis- 
tance from  a  market.  In  each  case,  the  vari- 
able could  be  used  to  adjust  for  varying 
sampling  intensities  within  the  several  strata 
defined  by  the  values  of  this  weighting  vari- 
able. 

The  only  critical  programmed  restriction 
is  that  only  integer  codes  with  a  maximum 
value  of  100  can  be  processed.'  The  program 
is  written  in  FORTRAN  60,  specifically  for 
the  CDC  1604  computer  of  the  Numerical  An- 
alysis Center  of  the  University  of  Minnesota. 
It  can  be  used  with  minor  modifications  on 
most  computers  with  the  following  character- 
istics: 

(DA  FORTRAN  60  or  FORTRAN  II  or 
compatible  compiler  capable  of  utiliz- 
ing — 

( a )  Two-dimensional  arrays, 

( b )  A  variable  data  format  ( not  in 
source  program ) , 

( c )  An  input  variable  as  the  limit  of 
an  implied  DO  LOOP  for  read- 
ing the  remainder  of  that  input 
record; 

( 2 )  Core  storage  of  32,000  data  words  in 
addition  to  the  compiler  storage  re- 
quirement. 

A  detailed  description  and  listing  of  the 
program  is  available  from  the  Biometrical 
Section  of  this  Station. 


5  Measurements  coded  by  alphanumeric  or  other 
midtiple  punch  combinations  must  be  changed 
to  this  form. 


October  1965 


ROBERT  C/LUCAS 

Principal  Geographer 

DENNIS  LySCHWEITZER/ 
Associate  Statistician' 


